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A Generalized Matrix Version of Rennie's Inequality' 

John Z. Hearon** 

(April 24, 1967) 

The matrix version of Rennie's inequality and the finite-dimensional version of Kantorovich's 
inequality are obtained by considering a positive definite matrix and its inverse. Generalizations of 
these inequalities are obtained in which the inverse matrix is replaced by a generalized inverse with 
certain prescribed properties. From the generalization of the Kantorovich inequality follows a (finite- 
dimensional) generalization of an inequality due to Strang. 
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1. Introduction 

It is the main purpose of this paper to derive an 
inequality which may be characterized as a general- 
ization of Mond's matrix version [5] 1 of Rennie's in- 
equality [7] and from which a generalization of the 
inequality of Kantorovich [4] follows. The inequality 
of Mond and some versions of the inequality of Kantor- 
ovich are derived by considering a positive definite 
matrix and its inverse. The derivation given here con- 
siders a positive semidefinite matrix and a general- 
ized inverse of that matrix. The generalized inverse is 
required to have certain prescribed properties. 

We begin with some preliminary considerations 
of generalized inverses of various classes. The main 
theorem is proved under the assumption that the 
generalized inverse, B, is in the weakest class and 
it is shown how the particular required properties 
of B may be weakened when B is in a stronger class. 

Thus far there seems to be no treatment of the case 
of strict equality for Mond's matrix inequality. In the 
case of the Kantorovich inequality the existing dis- 
cussions of strict equality are rather involved (e.g., 
see [3] and references therein). In any event the dis- 
cussion of equality is more involved than the de- 
rivation of the inequality itself. One feature of the 
derivations given here is that they enable a rela- 
tively straightforward discussion of the case of strict 
equality for both Mond's and Kantorovich's inequalities. 

2. Notation and Preliminaries 

We consider matrices with complex entries and for 
any matrix M denote by p(M), N(M), R(M) and M* 
the rank, null space, range and conjugate transpose, 
respectively, of M. As before [2] we define C\(A) to 



be trie set of all matrices B such that ABA =A, where 
A is a given, arbitrary matrix, C 2 (A) to be the set of all 
matrices B such that BeC^A) and BAB = B. We further 
define C- A (A) to be the set of all matrices B such that 
BeC>(A) and AB is a hermitian matrix, and C 4 (A) as 
the set of all matrices B such that BeC :i (A ) and BA is 
hermitian. We call a matrix Bed(A) a C-inverse of A. 
Other terminology has been used (see [9] and refer- 
ences therein and [8] and referee's suggestion and 
references therein). Note that a CVinverse is essen- 
tially 2 the weak-generalized-inverse of Goldman and 
Zelen [1] and that the C 4 - inverse is the Moore-Penrose 
generalized inverse [6]. We make repeated use of the 
following facts: If £eC,(/f), then p(B) ^ p(A) = p(AB) 
= p(BA) and AB and BA are idempotent matrices [9|. 
Moreover if BeC { (A), then BeC z (A) if and only if p(A) 
=p(B)[2,9]. 

For i = 1 or 2, a C-inverse of a hermitian matrix is 
not necessarily hermitian, but for every hermitian A 
there exists a hermitian Converse, /=1, 2. In fact 
given A = A* and any B A eC x {A) the matrix H = (B\ 
+ Z?i)/2 is in C\{A) and is hermitian [9J, and the matrix 
HAH is in C 2 (A) and is hermitian [2]. 

In what follows we need to consider the situation 
BeCi(A) and AB = BA. Although we consider only the 
case A=A* and B = B*, which is rather transparent, 
we prove the following lemma for completeness. 

Lemma 1. // N(A) = N(A*), BeC,(A) and E = AB 
= BA, then E is an orthogonal projection and x*Ex 
= x*x if and only i/xeR(A). 

Proof. Let BeCAA), then p(A) = p(AB) = p(BA) 
and if E = AB = BA it follows that N(B) ^ N(A) = N(E). 
From ABA=A, we have R(E) = R(A). If N(A) = N(A*) 
then N(A) is orthogonal to R(A) and E is a hermitian, 
idempotent matrix. Any x can be expressed as 
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2 Goldman and Zelen [1J define a weak generalized inverse B such that BeC z (4) and BA 
is hermitian. We follow Rohde [9] in defining BeC-^A) as in the text. Actually any statement 
in this paper with respect to a BeC :i {A) as defined here is true for the Goldman-Zelen weak 
generalized inverse. 
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x = u + v, with ueR(A) and veN{A). Then x*Ex = u*u. 
But u*u = x*x if and only if xeR{A). 

3. The Inequalities of Rennie, Kantorovich, 
and Strang 

We now prove the following theorem, which is 
the main result of this paper. 

THEOREM 1: Let A be a hermitian positive semi- 
definite matrix of order n and rank r. Let those roots 
of A which are strictly positive be ordered as k\ ^ X 2 
^ . . . ^ k r > and define s = Ai + A r , p=XiX r . 
If B is any hermitian C\-inverse of A which commutes 
with A and has v=r\ — r nonpositive roots, then for 
any x such that x*x= 1 we have 



and 



x*Ax + px*Bx ^ s 



(x*Ax) (x*Bx) ^ s 2 /4p. 



(i) 



(ii) 



If ki = k r =k, there is strict equality in (i) if and only 
if there is strict equality in (ii), and if and only if 
Ax=Xx. // k\ > k r , there is strict equality in (i) if 
and only if x=ui + u r , where Aui = XiUi and Au r 
= X r u r ; and there is strict equality in {ii) if and only 
if x = ui + u r and Ui*Ui ^u/ur^ 1/2. 

PROOF: Since A and B are hermitian and commute 
they have in common a complete orthonormal set of 
eigenvectors which we denote by xu indexed so that 
Axi= k&i, l^i^n. We also have Bxi = x\lki, 1 ^ i ^ r. 
Let the v remaining roots of B be — du n ^ i > r, d\ ^ 0. 
It is then clear that the matrix P = (A — kj) (A — k,I)B 
is hermitian negative semidefinite. For the vectors xt 
are eigenvectors of P with corresponding (nonpositive) 
eigenvalues jH; = (X; — Xi)(X; — X,)/X ( , 1 ^z'^r, and >lxj 
= — pdi, r<i^n. From BeCi(A) and AB = BA = E we 
have A=A 2 B = BA 2 and P may be written P = A—sE 
+ pB. For any x, x*Px ^ and if x*x= 1, then 



x*Ax + p{x*Bx) ^ s(x*Ex) ^ s. 



(1) 



Thus (i) is proved. Define ifj = x*Ax and p(x*Bx)=if. 
Then, from (1), ijj + tp ^ s and since *// ^ 0, we have 



ifnp ^si//-i// 2 =sss 2 /4 



(2) 



which gives (ii). 

Let e\ denote the condition: strict equality obtains 
in (i); and let e% denote the condition: strict equality 
obtains in (ii). 

If ki = k, = k we have to prove e 1 ^e2^Ax = \x. We 
show Ax = kx implies both e\ and e 2 : then e\=^ Ax= kx\ 
then e 2 =)ei. If Ax=kx, then it is easily verified that 
ijj = (p = k = s/2, and we have e\ and e>. Now assume 
6i, then from (1), x*Ex= 1 which, by Lemma 1, implies 
xeR(A) and x # 0. But every 7^ xeR(A) obeys Ax— kx. 
Thus e i= AAx = kx. Now assume e 2 , then from (2) we 
have i//= ip = s/2 which gives e\. Thus e 2 =)ei. This com- 
pletes the cases of strict equality when Xi =k r . 

If Xi > X r we have to show e\<^x= U\ + u T , where 
u\ and u r are any eigenvectors of A associated with 



the eigenvalues Xi and X r respectively. We also must 
show e>2&x = Ui + u r , where U\ and u r are, in particu- 
lar, normalized so that U\*u\ = u r *u r = 1/2. Suppose 
x = Ui-\-u r and U\*u\ = k\, u r *u r =kr. Then 



\fj= AiXi + k r k r 
<p = kik,-\- k r k\ 

x*Ex= ki + k r = 1 



(3) 

(4) 
(5) 



and the sum of (3) and (4) give, in view of (5), \\$ + <p = s 
which is e\. Conversely assume e\, then from (1) we 
have x*Px = 0, which requires xeN(P), and x*Ex = l, 
which requires xeR(A). One basis for N(P) consists of 
the following subset of the eigenvectors x t of A: those 
Xi with i such that X/ = Xi; those xu with k such that 
kk= X r ; and those xj (if any) withy such that kj = dj = 0. 
Thus xeN(P) can be written x = ui + u r + v with 
veN{A). But also xeR(A), which means v*x = Q and 
therefore (since v*u\ = v*u r = 0), v = 0. Thus e\€$>x = u\ 
+ a r is proved. Now assume x = U\ + w r and k\ = A 2 
= 1/2. Then (3) and (4) read \p = s/2 and <p = s/2, respec- 
tively, which gives e 2 . Conversely, assume e 2 , then 
from (2), i// = ^> = s/2, which gives ei. We have just 
seen that e\ implies x — U\-\- u r which in turn implies 
(3), (4), and (5). But when i// = ^> = s/2 any two of (3), (4), 
and (5) give k\ =k r — 1/2. This completes the proof of 
the theorem. 

In case r=n, clearly B = A~ l and from (i) of the 
theorem we recover Mond's matrix version [5] of Ren- 
nie's inequality [7]; from (ii) of the theorem we have 
the inequality of Kantorovich [4]. In this case we also 
have, from Cauchy's inequality, (x*Ax) (x*A~ x x) ^ 1. 
In case r < n, we have :i f(x) = (x*Ax)(x*Bx) ^ 1 when- 
ever xeR(A), but this inequality is in general false. 
We may infringe it with the choice x = (x\ ■+■ x n )l V 2 
which gives f(x) = (l - Xid w )/4 ^ 1/4. In fact f(x) ^ 1 
if and only if r = n. For if r=n we have just seen that 
f(x) ^ 1. If f{x) ^ 1 for all x*x = 1 then x*Ax i^ for all 
x*x = l, in particular x*Axt^0 for all i, and A is 
strictly positive definite. 

Preparatory to the discussion of alternative speci- 
fications of the matrix B in Theorem 1, we prove the 
following theorem. 

Theorem 2. Let A be a given matrix, {i) If B is a 
Ci-inverse of A and commutes with A, then E = AB is 
uniquely determined, (ii) If B is a Co-inverse of A and 
commutes with A, then B is unique and is a polynomial 
in A. If A is normal (in particular hermitian) then B 
is normal (in particular hermitian). 

Proof, (i) From ABA=A and AB = BA, we have 
BA 2 = A, which shows that p(A) = p(A 2 ). We further 
have Ex = ABx = x when xeR(A), and that Ey=--ABy 
= BAy = when yeN(A). Since p{A) = p(A 2 ) requires 
that R(A) and N(A) be disjoint subspaces, E is uniquely 
determined by A. (ii) Let B, and B 2 be (^-inverses which 
commute with A. From (i), E = BiA = B 2 A and hence 



3 If xeR(A), then x = ^ a iXi . Thus 

/W-(i|^ i *i)(i|«| , /fc)>l 
follows from Cauchy's inequality. 
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Ba = B 2 x when xeR(A). Since [9, 2], p(A)^p(AB ] ) 
= p(AB>) = p(B l ) = p(B>), it follows from E = AB { 
= AB> = B X A that N(Bi) = N(B 2 )=N(A) and hence that 
B l y = B>y = when yeN{A). Thus B { =B>. From pM) 
= p04 2 ) we know that A = Q l diag (/*i, #)(>, where 
p(y4i) = p(^4) and A\ is nonsingular and is a square 
zero matrix. Let p(t) be the polynomial such that p{A\) 
=AlK Then p(A) = Q~ l diag (/*t\ K)Q = G, where 
K = p{0), is a d-inverse of /I. For, G4 is idempotent 
and p(GA) = p{A) and this is known [2] to be necessary 
and sufficient for GeC\(A). Moreover, GAG = B is a 
polynomial in A, and [2] further BeC>(A). If A is normal 
clearly B is normal. If A=A* then the coefficients of 
p(A) are real and B is hermitian. 4 

In the proof of Theorem 2, we have seen that 
BeCi{A) and AB = BA together imply p(A) = p(A 2 ). 
Given this rank condition on A, we have seen how to 
construct a polynomial in A which is a C\ -inverse of 
A and, from this, a polynomial in A which is a C 2 -inverse 
and uniquely determined among those BeC\(A) which 
commute with A. We set this forth as the following 

COROLLARY: There exists matrices B€Ci(A) which 
commute with A if and only if p(A) = p(A 2 ). In this 
case, there exists a unique BeC 2 (A) which is a polyno- 
mial in A. 

The condition in Theorem 1 that v roots of B are 
nonpositive cannot be omitted if p(B) > p(A). For if 
x = (x l + x n )/X / 2~ we have x*Bx = (l-kidn)l2ki. If 
d n < 0, B has a strictly positive root and both (i) 
and (ii) are violated when — d n > (s 2 — p)/\\p. 

If in Theorem 1 we specify that BeC>{A), then the 
condition that B be hermitian and the condition on 
the roots may be dropped. For, by Theorem 2, if 
BeCi{A) and commutes with A then B is a polynomial 
in A. Further since A is hermitian, that polynomial is 
hermitian. But then we have BeC 4 (A), since A and 
B are hermitian and commute and AB = BA is hermi- 
tian. If we specify BeC 4 {A) then the theorem holds 
true with no further conditions on B. For, if BeC 4 {A) 
and A = A* it is known [6] that B = B* and AB = BA. 
If we require BeC^A) and B = B*, then no further 
conditions on B are required for the validity of the 
theorem. For, if BeC^A), A=A* and B = B* then in 
fact BeC 4 (A), since {AB)* = AB implies BA is hermitian. 

If we consider the conditions of Theorem 1: (a) B 
is hermitian, (b) B commutes with A, and (c) B has 
v= n — r nonpositive roots, then we have the following 
summary statement: Theorem 1 remains valid when 
the statements there regarding B are replaced by 
BeC>{A) and (b): and when they are replaced by 
BeCi(A) and (a); and when they are replaced by 
BeC 4 (A). 

It is of interest to note that if in Theorem 1 the con- 
ditions on B are replaced by the conditions that B is 
normal and BeCAA), then the theorem is valid. This 
follows from 

Lemma 2. If A is normal and BeC^A), then A and B 
commute if and only ifB is normal and if and only if 
B€C 4 (A). 

4 It is also clear that we have B = Q~ l diag (A ~ ' 6)0. For this matrix is clearly a C ..-inverse 
which commutes with A and we have just shown that there is only one such matrix, trom 
this it follows that A normal (in particular hermitian) implies B normal (in particular her- 
mitian) since in these cases Q is unitary and A\ is normal (in particular hermitian). 



PROOF. Let A be normal, p{A) = r and Axi = \iXu 
h # 0, 1 ^ i ^ r. Then BeC A (A) ^ABA =A ^ABx< 
= xi. But BeC,(A) =$AB = (AB)* and thus we have x,*AB 
= x? or Xj*B = Xj*l\j, for 1 ^ i ^ r. If we write E = AB 
= £* then since p(E) = p(A) = p(B) we have N(E) 
= N(A) = N(B). If B is normal then N(B) = N(B*) and 
we have thus proved that A and B have a complete 
set of orthonormal eigenvectors in common and hence 
they commute. Now let AB = BA. Then since AB is 
hermitian by hypothesis, BA is hermitian and BeC 4 {A), 
further by a known theorem [6] B is normal. 

It is clear from Lemma 2 that if A is hermitian and 
BeCiA) is normal, then B is hermitian, for as we have 
seen BeC 4 (A) and this implies [6] that B = B*. Thus if 
BeCs(A) and is normal no other conditions on B are 
necessary for the validity of Theorem 1. 

Finally, by use of (i) of Theorem 1 we can generalize 
the finite-dimensional version of a result of Strang [10] 
(this possibility was pointed out to the author by John 
W. Evans). 

THEOREM 3. Let T be an arbitrary square matrix 
of rank r. Let those singular values of T which arc 
strictly positive be ordered as K\ ^ \> ^ • . . ^ A>- >0 
and define s = Ai + A. r , p = \iA r . If L is the C 4 -inverse 
o/T, then, for all x, y 

|(y*Tx)(y*L*x)|^^-(x*x)(y*y). (6) 

PROOF. Let T=QA be the polar factorization of T, 
where Q is unitary and A is positive semidefinite. If 
B is the Converse of A then L = BQ* is the Converse 
of T. More, B is positive semidefinite and commutes 
with A. For it is known [6] that B is hermitian and com- 
mutes with A and hence [9], as seen in the proof of 
Lemma 1, that the nonzero roots of B are reciprocals 
of the nonzero roots of A. Thus A and B meet the con- 
ditions of Theorem 1. Let x and y be any vectors and 
define z = Q*y. Then by the Schwartz inequality, 



(y*Tx) {y*L*x)\ = \(z*Ax) {z*Bx)\ 



. [(x*Ax) (z*Az) (x*Bx) (z*Bz)yi 2 . 



(7) 



The singular values of T are the roots of A and (ii) of 
Theorem 1 can be written, for any x, as (x*Ax) (x*Bx) 
^ (5 2 /4p) (x*x) 2 . From this, (7) and z*z = y*y we obtain 
(6). 

Remark: We cannot in Theorem 3, specify that L 
be a generalized inverse of T weaker than C 4 {T). It 
is a fact that if T= QA , Q is unitary, and L = BQ*, then 
LeCiT) if and only if BeCi(A), 1 ^ i ^4. But it is not 
enough to have B meet the conditions of Theorem 1. 
For in using the Schwartz inequality to obtain (7) we 
additionally require that B be positive semidefinite. 
This requires that at least BeCi{A) and, as we have 
seen, this results in BeC 4 (A). For if B meets the con- 
ditions of Theorem 1 and is positive semidefinite, then 
the v nonpositive roots are in fact zero, so that p(A) 
= p(B) and BeC 2 (A). 

Addendum: The author's attention has been directed 
(by A. S. Householder) to the theorem of Diaz and 
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Metcalf [Bull. Am. Math. Soc. 69, 415-418 (1963), 
theorem 3, p. 417] which includes as a special case a 
Hilbert space version of Rennie's inequality [7]. They 
give a succinct statement of the condition for strict 
equality which, in the terminology of the proof of 
Theorem 1 of this paper, is: We have e\ if and only 
if xeN(P). 
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